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THE  DISTRIBUTION  OF  EXTREME  VALUES  OF  SAMPLES  DRAWN  FROM 
A  GAUSSIAN  POPULATION 

by 

Maurice  J.  Daintith 


ABSTRACT 


For  some  applications  it  is  necessary  to  know  the  probability  distribution 
functions  of  the  maximum  value  (either  absolute  or  else  without  regard 
to  sign)  occurring  in  large  samples  drawn  from  a  parent  stochastic 
population.  Exact  egressions  are  derived ,  and  in  addition  useful  approx¬ 
imations  and  limiting  forms  are  presented;  these  also  bring  out  clearly 
some  interesting  properties  of  the  distributions . 


INTRODUCTION 


If  samples  of  size  n  are  drawn  from  a  gaussian  population,  two  extreme 
values  can  be  identified  within  each  sample. 

Two  types  of  extremum  may  be  defined.  The  first  is  the  true  maximum 
value.  This  would  be  the  appropriate  parameter  for  use  in,  say,  designing 
a  sieve.  The  second  type  is  the  maximum  deviation  from  the  mean.  This 
would  be  natural  in  investigating  noise  peaks.  This  memorandum  deals 
mainly  with  the  second  of  these  distributions. 

The  purpose  of  this  memorandum  is  to  investigate  the  statistical  properties 
of  these  extremes,  present  numerical  values,  and  derive  some  approximate 
expressions  for  practical  use. 


1  NOTATION,  FUNDAMENTAL  EQUATIONS,  AND  BASIC  EXPRESSIONS 

1.1  Notation 

The  notation  adopted  is  that  used  by  Abramovitz  and  Stegun  [ij .  Throughout, 
the  parent  population  will  be  taken  as  having  zero  mean  and  unit  standard 
deviation. 


1 .2  Fundamental  equations 


The  gausslan  probatlllty  density  function  Is 

Z(x)  ■  exp(-2$z2)/v''“2iF 

(Eq.  1) 

and  cumulative  probability  functions  are 

P(z)  =  /2  Z  dz 
-00 

(Eq.  2) 

Q(z)  =  1  -  P(z)  =  rz  1  dz 

(Eq.  3) 

A(z)  *  2P(z)-l  =  l-2Q(z)  =  2  /2  Z  dz  , 

(Eq.  4) 

(z  positive) 

so  that 

Z  =  3P/3z  =  -3Q/3z  *  %3A/  3z 

(Eq.  5) 

Further  useful  relationships  are 

3Z/3z  =  -zZ 

(Eq.  6) 

and 

P(-z)  =  Q(z)  =  l-P(z)  . 

(Eq.  7) 

1 .3  Basic  expressions 

Consider  a  sample  of  n.  Cn(x),  the  cumulative  probability  function  of 

x,  where  x  Is  the  greatest  value  of  |z|  in  the  sample,  can  be 

Immediately  written  down.  Cn(x)  is  the  probability  that  all  members  of 

the  sample  lie  between  ±  x.  For  each  member  of  the  sample,  the  probability 

is  P(x)  -  P ( —x ) ,  which  from  Eq.  7  is  2P(x)-l,  and  from  Eq.  4  is 

A(x).  Therefore  for  a  sample  of  n 

Cn(x)  =  An(x)  .  (E  (Eq.  8) 


The  probability  density  function  pn(x)  =  3Cn(x)/3x,  which,  using 

Eq.  5,  becomes 

pn(x)  =  2nZ(x)  An_1(x)  .  (Eq.  9) 
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The  maximum  likelihood  value  of  x  Is  found  by  setting  3p  (x)/3x  ■  0. 
Differentiating  Eq.  9,  and  using  Eq.  6,  yields 

2(n-l )Z(xi )  =  XiA(xi)  ,  (Eq.  10) 

which  Is  a  transcendental  equation  yielding  Xi  as  a  function  of  n. 

The  mean  value  x,  Is,  from  Eq.  9, 

x  -  2n  f*  xZ(x)  An-1(x)  dx  (Eq.  11) 

and  higher  moments  may  be  similarly  defined. 


2  EXACT  NUMERICAL  RESULTS 


2.1  Confidence  limits 


Given  a  confidence  limit  C 


A(xc) 


/ 
C 


from  Eq.  8 


(Eq.  12) 


and  from  published  tables  of  P(x)  [%(1+A(x))],  or  by  using  an  appropriate 
approximation,  [e.g.  [ij  Ch.  26  para  2.22],  x£  may  be  readily  determined. 

Figure  1  gives  plots  of  xQ  as  a  function  of  n  over  the  range  n  =  10 

to  n  *  10®,  for  values  of  C  of  0.99,  0.95,  0.5,  and  0.05.  The  value  for 
C  *  0.5  Is,  by  definition,  the  median,  xm. 

Two  features  of  these  curves  are  noteworthy.  In  the  first  place,  for 
all  of  them  x  Is  a  very  slowly  varying  function  of  n.  For  example, 
the  median  value  Increases  from  1.83  to  only  4.97  over  a  five  decade 
range  of  n . 

Secondly  the  50%  value  does  not  He  midway  between  the  5%  and  95%  values. 
Indicating  that  the  distribution  Is  markedly  skew.  The  extent  of  the 
skewness  can  be  appreciated  from  Fla.  2,  which  shows  plots  of  the 
probability  density  function  (Eq,  9)  for  values  of  n  of  102,  104  and 
10*.  Equation  9  shows  Immediately  why  this  should  be  so.  The  factor  Z 

n  1 

In  that  expression  Is  multiplied  by  A  .  Although  A  Itself  Is  very 
close  to  unity  over  the  whole  range  of  Interest,  when  It  Is  raised  to 
the  high  power  of  n-1  It  varies  very  rapidly,  depressing  the  probability 
for  the  lower  values  of  x. 
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2.2  Maximum  likelihood 


Equation  10  may  be  written  as 

exp(-%x2)  *  XjA(xi)  / ir/2/(n-l)  ,  (Eq.  13) 

and  In  this  form  Is  readily  solved  Iteratively,  since  the  exponential 
term  varies  much  more  rapidly  than  the  terms  on  the  right-hand  side. 
[Note  that  A(xi)  Is  here  only  to  the  first  power,  and  differs  signif¬ 
icantly  from  unity  only  for  the  smaller  values  of  n;  the  lowest  value, 
for  n  a  10,  Is  0.92.  It  Is  therefore  justifiable  to  use  a  reasonable 
approximation  for  A(xi);  a  suitable  one  Is  given  In  [l]  Ch.  26, 
para  2.16.] 

The  resulting  values  for  the  maximum  likelihood  value  —  the  mode — 
are  plotted  in  Fig.  1.  They  run  nearly  parallel  to,  and  not  far  from, 
the  median  (50%  confidence  limit).  This  suggests  that  these  central 
values,  despite  the  non-gausslan  distribution,  are  useful  parameters. 


2.3  Mean 


Equation  11,  for  the  mean  value,  cannot  be  evaluated  analytically. 
Numerical  Integration  would  be  a  possibility,  but  It  seemed  preferable 
to  search  for  an  adequate  approximation  that  would  give  some  feeling  for 

the  way  that  x  varies  with  n.  An  approximation  sufficiently  good 
over  a  part  of  the  range  (n  =  102  to  l(r)  Is  described  In  Ch.  3.  The 
results  are  shown  as  the  dotted  line  In  Fig.  1.  There  Is  a  reasonable 
coincidence  with  the  median  curve,  once  again  supporting  the  usefulness 
of  the  central  values. 


2.4  Variance 


In  principle,  the  variance  would  be  obtained  by  evaluating  the  mean 

square  value  of  x[=x2]  ,  and  the  mean  value  x,  and  calculating  the 
variance  as 

7  -  X2 


However,  neither  quantity  can  be  calculated  exactly,  and  since  the 
difference  turns  out  to  be  small  compared  with  the  two  mean  values,  the 
approximation  used  for  calculating  the  mean  Is  not  adequate^  In  addition, 
even  In  u?'  .g  the  same  type  of  approximation  to  calculate  x2,  the 
result  arrears  as  a  small  difference  between  two  large  quantities.  For 
this  reason,  and  since  anyway  the  distribution  Is  so  skew,  this  approach 
was  abandoned. 
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Instead,  I  have  used  a  quantity  which  I  refer  to  as  the  semi-spread,  s  , 
defined  as  follows. 


Using  Eq.  12,  compute  x  for  the  confidence  limits  of  C  =  0.84134  and 
0.15866.  Half  the  difference  between  these  x-values  Is  defined  as  the 
semi-spread,  s.  If  the  distribution  had  been  gausslan,  s  would  In  fact 
have  been  the  standard  deviation.  In  any  event,  it  defines  a  range 
within  which  68%  of  the  x-values  lie. 

The  quantity  s  Is  plotted  In  Fig.  1.  It  decreases  with  n  ,  although 
very  slowly,  demonstrating  the  central  tendency  In  the  distribution  (as 
evident  In  Fig.  2).  The  small  value  of  s  compared  with,  say,  xi  or 
xm  shows  that  these  latter  values  are  reasonably  well-defined,  and 

therefore  of  potential  value. 

Some  confirmation  that  the  semi-spread  s  is  a  meaningful  value  in 
defining  the  precision  of  the  distribution  was  obtained  by  finding  the 
ratio  of  the  range  between  the  5%  and  95%  values  of  x  to  the  whole 
spread  2s.  Over  the  whole  range  of  n  from  10  to  10s,  this  ratio  was 
nearly  constant  (1.66  to  1.70),  indicating  that  the  shapes  of  the  curves 
were  sensibly  the  same.  (It  is  of  Interest  that  the  value  of  the  ratio 
for  a  truly  gausslan  distribution  is  1.64,  not  markedly  different  from 
the  figures  quoted  above.) 


3  APPROXIMATIONS 


The  aim  of  finding  simple  approxtmat1  -* 
previously  calculated  is  partly  to  prr' 
and  partly  to  give  an  easy  understanding 
meters  vary  with  n. 


he  statistical  quantities 
quick  method  of  calculation, 
he  way  In  which  the  para- 


3.1  Median 


From  Eq.  12,  the  median  value  (c 
from 


A(xm)  = 


(%) 


V 


n 


k)  is  determined  by  finding  xm 


(Eq.  14) 


Now  over  the  range  considered,  A  is  not  much  less  than  unity,  so  that 
Q(xm)  =  %(1-A)  Is  a  small  quantity. 

Hence,  since  A  =  1-2Q,  In  A  %  -  2Q,  and,  from  Eq.  14, 

-2Q^l£n2,  (Eq.  15) 

or 

Q(xm)  %  £n2/2n 
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The  problem  now  is  to  find  xm  from  Eq.  15.  In  paras  2.22  or  2.23  of 

Ch.  26  of  [1]  It  is  shown  that,  with  considerable  accuracy,  x„  Is  a 
- - -  m 

function  of  /£n(l/Q2),  so  that,  from  Eq.  15,  x2  is  a  function  of 
£n(n).  Plotting  x2  against  In  n  produced  a  nearly  straight  line 
almost  passing  through  the  origin,  suggesting  that  a  crude  approximation 

would  be  x  «  /Tn  n.  An  obvious  attempt  to  find  a  better  approximation 

m  q 

would  be  to  try  a  modified  power  law,  viz:  nr  .  or,  taking 

logarithms.  In  xm  =  <*+g£n£n  n,  a  linear  relationship.  A  plot  of 
In  xm  against  Inin  n  produced  a  straight  line  to  within  the  limits  of 
graph  paper  accuracy. 

Accordingly,  a  linear  regression  curve  was  fitted  to  In  xm  as  a  function 

of  £a£nn  (using  values  of  n  of  10,  20,  50,  100,  200,  .....  106). 

The  result  was 

In  xm  =  0.1397  +  0.5593  Inin  n  (Eq.  16) 

■with  the  very  high  coefficient  of  determination  of  0.99995. 

Equation  16  yields 

xm  =  1.1499(£n  n)0’5593  (Eq,  17) 

Equation  17  predicts  the  value  of  xm  over  the  whole  range  of  n  to  an 

accuracy  of  better  than  0.3%  (i.e.  of  at  most  two  units  in  the  second 
decimal  place). 

It  will  be  noted  that  this  differs  only  slightly  from  the  square  root 
'first  guess' . 

3.2  Mode 

Since  the  modal  curve  so  closely  parallels  the  median  (Fig.  1)  it  is 
natural  to  try  the  same  type  of  approximation.  The  straight  line 
regression  fit  yielded 

In  xi  =  0.0732  +  0.5795  Inin  n 

with  a  coefficient  of  determination  of  0.99986,  so  that 

Xi  =  1 .0760(£n  n)0’5795  (Eq.  18) 

The  errors  from  Eq.  18  are  at  most  about  0.7%  (about  three  units  in  the 
second  decimal  place). 
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3.3  Semi -spread  s 

The  semi -spread  Is  defined  as  %(xjj-xa),  where 
A(xa)  -  C.V"  ,  A(xb)  =  Cb‘/n 

*  0.15866,  Cb  *  0.84134.  Writing  A  =  1-2Q  and  approximating,  as 
above, 

Q(xa)  'v  ^nCfl)/n,  Q(xb)  'v-  UnCb)/n. 


Now  try  the  very  crude  approximation 
x2  -v  -2£nQ  , 

whence 

x£  -  x2  -v  2  ^n(Qa/Qb)  ^  2  £R[UnCa)/.enCb]  =  4.73  . 


But  xk  -  x  =  2s,  and  xk  +  x.  'V/  2x_  ,  whence  sx_  ^  1.18. 

b  a  bam  m 


This  relationship  is  not  too  badly  astray,  since,  using  the  calculated 
values,  it  turns  out  that  sx  varies  only  from  0.92  to  1.13  over  the 
whole  range  of  n,  while  s  mitself  varies  by  a  factor  of  2.2. 

The  obvious  suggestion  is  therefore  to  try  an  expression  of  the  form 

g 

s  *  °c  (£n  n)  ,  in  the  same  way  that  xm  was  fitted.  As  might  be 
expected,  the  approximation  is  worst  for  small  n,  and  it  appeared  more 
satisfactory  to  omit  the  value  n  =  10,  starting  from  n  =  50.  The 
result,  with  a  coefficient  of  determination  of  0.9989,  is: 

s  =  0.787  (£n  nf0'469 

In  which  the  errors  are  <  1%  for  n  >  50,  becoming  2%  for  n  =  20,  and 
5.5%  for  n  =  10. 


3.4  Mean 


The  evaluation  of  Eq.  11,  repeated  here  for  convenience,  presents  some 
difficulties 

x  =  2n  /“  xZ(x)  An"^(x)dx  .  (Eq.  11 ) 


Straightforwardly,  one  would  endeavour  to  find  an  approximation  for 
A(xJ  that  would  allow  Eq.  11  to  be  Integrated.  However,  A  Is  raised 
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n  ^  1 

to  a  very  high  power  for  large  n,  and  the  errors  In  A  would  be 
Intolerable.  However,  Eq.  11  may  be  transformed  Into  a  more  malleable 
form  as  follows.  Noting  that  xZ  =  -3Z/9x  [Eq.  6],  Eq.  11  may  be 
Integrated  by  parts,  yielding 

x  =  -2nZ  An_1  /“  +  2n(n-l )  /“  ZAn’2  ^  dx 


The  first  term  vanishes  at  both  limits,  and  in  the  integral  It  Is 
legitimate  to  use  A  as  the  variable  of  integration  (since  A  is  a 
monotonic  function  of  x). 

Thus  x  =  2n(n-l )  f'  Z  An"2  dA  (Eq.  19) 

Now  in  Eq.  19  if  one  can  find  an  approximation  for  Z,  as  a  function  of 
A  ,  that  is  correct  to  a  few  percent  over  the  region  of  importance 

(which  from  Fig.  2  is  confined  to  a  restricted  range  of  values),  x 
can  be  determined  to  about  the  same  accuracy  (since  Z  is  raised  to  the 
first  power  only). 

The  problem  is,  then,  to  find  an  approximation  to  Z  as  a  function  of 
A.  Not  only  must  it  be  of  adequate  accuracy,  but  it  must  also  be  of  a 
form  permitting  Eq.  19  to  be  integrated.  For  example,  an  approximation 
can  be  found  for  x  in  terms  of  In  Q  i.e.  in  terms  of  £n(l-A),  and 

since  Z  =  1//2tt  exp(-%x2),  this  will  give  Z  as  a  somewhat  complicated 
exponential  function  of  fn(l-A).  But  this  does  not  lead  to  an  integrable 
form. 

A  possible  solution  is  suggested  as  follows. 

Para  2.4  of  Ch.  26  in  [l]  gives  an  upper  bound  (which  is  also  quite  a 
good  approximation)  to  P(x)  ,  viz: 

P(x)  =  %[1  +  (l-exp(-2x2/n))'5] 


or 


A2(x)  =  l  -  exp(-2x2/u) 


so  that,  if  the  inversion  were  permissible, 

Z  =  (l//"2rr)  exp(-%x2) 

^  (l//"2rr)  [1-A2f/4 

However,  this  is  a  poor  approximation.  Over  the  range  2  <  x  <  4, 
errors  of  as  much  as  13%  in  Z  are  given  by  this  expression.  It  does 
suggest  however  that,  at  least  over  a  limited  range,  one  may  approximate 
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■ay  approximate  z  In  the  form  Z  ^  «[1-A2]^,  where  «  and  $  are 
constants.  Taking  logarithms,  this  Implies  that  x2  Is  a  linear 
function  of  £n(l-A2).  It  was  found,  In  practice,  that,  over  the  range 
2  <  x  <  4, 

-  x2/2  'v.  0.2532  +  0.9164  £n(l-A2}, 
or 

Z  ■  0.5139(1-A2)0,9164  . 

with  errors  <3%  over  the  whole  range. 

Applying  this  approximation  to  Eq.  19  yields 

x  'v  2n(n-l )  x  0.5139  /* (1-A2)0*9164  An“2  dA  . 

Writing  A  ■  t% 

x  *  0.51 39n(n-l )  /‘(l-t)6t(n"3)/2dt,  (Eq.  20) 

where  6  *  0.9164  . 

But  the  Integral  Is  a  known  beta  function 

x  »  0.5139  n(n-l)  B(l+$,^)  , 

where 

B(l+$  ,(n-l )/2)  =  |  (1+3)  | (n-1 )/2  j  |  (n+l+26)/2  (Eq.  21) 

Expression  21  can  be  considerably  simplified,  using  Stirling's  approximation 
for  the  two  gamma  functions  involving  n. 

Neglecting  terms  of  the  order  1/n  (and  noting  £n(1-l/n)<v  -  1/n)  , 

In  |(n-l)/2  'v  klnfa)  +  (n/2-l)[£n(n-l)  -  £n  2]  -  n/2+% 

Jj£n(2ir)  +  (n/2-1 )  -Zn(n/2)  -%  -n/2  +% 

Similarly, 

In  (n+l+2$/2)  £n(2ir)+(n/2+$  )  £n(n/2)+%  (1+2$  )  -  n/2  -%-$. 

whence  ^  j(n_i)/2)  /  |  (n+l+2$)/2)] 

*  -(l+$)£n(n/2)  , 


so  that  B(l+e»  (n-lJ/2)  ^  |  (1+6)  (n/2)'^+^ 
and  from  Eq.  20 

x  =  0.5139  |(l+6)  (n-1)  n’e  2e  . 

|l+6  ■  0.9678,  so  the  final  result  Is 
x  »  1.877  x  (n-l)/n0,9164 
or,  for  large  n  ,  x  ^  1.877  n°'0836 

It  Is  this  equation  that  hasbeen  used  over  the  range  shown  tn  Ftg.  1  to 
calculate  a  rough  value  of  x  .  Errors  of  several  units  In  the  second 
decimal  place  would  be  expected. 


4  THE  ABSOLUTE  MAXIMUM 


If  Interest  lies  In  the  distribution  of  the  maximum  value  of  x,  taking 
account  of  sign,  a  similar  treatment  to  that  for  the  modular  values 
presents  no  extra  problems.  Evidently  the  cumulative  distribution 

function  would  be  C(x)  -  Pn(x)  (where  x  can  now  range  between  +» 
and  -«),  and  the  distribution  function  would  be,  by  differentiating, 

p(x)  =  nPn-1Z. 

However,  It  Is  possible  to  obviate  the  whole  problem  by  the  following 
considerations.  Evidently,  we  are  interested  only  tn  the  range  of 
values  over  which  K(x)  differs  significantly  from  zero,  which  we  may 
plausibly  expect  will  be  the  range  over  which  C(x)  is  not  too  far 
removed  from  0.5.  Furthermore,  we  would  expect  P(x)  and  A(x),  for 
large  n  at  any  rate,  to  be  not  much  smaller  than  unity;  i.e.  Q(x) 
will  be  small . 

Now  since  P  =  1-Q,  and  A  =  1-2Q,  we  may  expand  the  two  cumulative 

functions  Pn  and  An  by  taking  logarithms.  Retaining  two  terms  In  the 
expansions 

InP'v-  Q+J{Q2 

£n  A  'v  -2Q+2Q2  , 

whence  _ 

In  Pn  n.  -  nQ+JjnQ2 

in  An  'v.  -2nQ+2nQ2  . 
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Substitute  2n  In  the  expression  for  In  Pn 
In  P2n  -v  -2nQ+nQ2 


and  hence 

£n[P2n/An]  *  -nQ2 


Now  nQ  Is  not  too  far  removed  from  0.5  and  therefore  at  maximum  Is, 
say,  unity; 

so  that  £n[P2n/An]  Is  of  the  order  of  -Q 

or  P2n/An  'v  e"^  ^  1-Q,  and  since  Q  Is  small  P2n  'v  An. 

The  Implication  of  this  Is  that  the  distribution  of  absolute  values  Is 
almost  exactly  the  same  as  the  distribution  of  modular  values  for  a 
sample  of  half  the  size.  In  other  words,  the  curves  of  Figs.  1  and  2 
may  be  used  to  estimate  absolute  values  If  entered  with  n  ■  half  the 
number  In  the  sample. 

A  few  trials.  In  fact,  showed  that  this  holds  true  even  down  to  n  ■  10 
to  an  accuracy  of  at  least  three  decimal  places. 

This  Is  not  an  unexpected  result,  noting  that,  for  each  sample  of  n, 
selecting  the  greatest  modulus  Is  equivalent  to  selecting  the  largest 
value  from  either  the  positive  values  or  the  negative  values,  while  the. 
selection  for  the  absolute  maximum  will  involve  generally  only  the 
positive  values  (the  change,  In  a  large  sample,  of  the  true  maximum 
being  negative  can  be  neglected  for  all  except  very  small  values  of  x). 
Hence,  for  the  absolute  maximum  we  normally  can  Ignore  all  the  negative 
values,  which  will  constitute  about  one-half  of  the  total  sample. 


5  A  GENERAL  LIMITING  FORM  FOR  LARGE  SAMPLES 


In  the  course  of  this  small  study,  I  observed  a  very  Interesting  limiting 
form  that  holds  for  a  large  class  of  parent  distributions.  The  observation 
Is  not,  however,  new,  since  It  appears  In  [i]  Ch.  26,  para  1.30  as 
", Extreme-Value  (Fisher-Tippett  Type  lt  or  doubly  exponential)" .  The 
reference  to  the  original  work  Is  not  readily  available,  so  that  I  do 
not  know  If  the  derivation  I  give  below  corresponds  with  that  of  Fisher 
and  Tippett. 

Consider  a  parent  distribution  Z(x),  In  which.  Initially,  the  only 
requirement  Is  that  Z  approaches  zero  asymptotically  as  x  tends  to 
Infinity.  The  cumulative  probability  P(x)  Is  defined  as  usual,  by 


*  P  ■  /*  Z  dx  (Eq.  22) 

—CP 
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For  the  absolute  maximum  of  a  sample  of  n  ,  the  cumulative  probability 
as  before 

C(x)  =  Pn(x)  (Eq.  23) 

and  the  probability  distribution 

p(x)  *>  3C/3 x  (Eq,  24) 

Define 

y  =  In  C  =  n  In  P  (Eq.  25) 

Since  C  -*■  1 ,  y  -*■  0  as  x  ~ 
differentiating  Eq.  25,  and  using  Eqs.  22  and  24, 

3y/3x  =  p/C  =  nZ/P  .  (Eq.  26) 

and,  differentiating  again 

32y/3x2  =  (3p/3x)/C  -  p2/C2  . 

.  =  n(3Z/3x)/P  -  nZ2/P2  (Eq.  27) 

Eliminating  P  between  Eqs.  26  and  27  yields  the  two  equations 

32y/3x2  -  (3£n  Z/3x)(3y/3x)  +  (3y/3x)2/n  ■  0  (Eq.  28) 

and 

(3p/3x)/C  =  (3y/3x)2+(3£n  Z/3x)j[3y/3^-(3y/3x)2/n  ,  (Eq.  29) 


Now  consider  the  maximum  value  of  p  (the  modal  value),  and  let  this 
occur  for  x  =  «  (this  was  denoted  by  x.  in  previous  sections).  From 
Eq.  29  this  will  be  given  by 

(3y/3x)  (1-1  /n)  =  -{Un  Z/3x)  .  (Eq.  30) 


Since  Z  decreases  with  x  ,  by  hypothesis,  the  right-hand  side  of 
Eq.  3  Is  positive,  and  will  be  written  as  (1/8),  where  $  Is  a  positive 
quantity.  Since  we  are  considering  very  large  n  ,  the  term  1/n  on 
the  left-hand  side  may  be  dropped,  yielding 

(3y/3x)a  =  -(3£n  Z/3x)a  =  1/e  .  (Eq.  31) 
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Equation  28  may  be  written  as 

32y/3x2  +  (3y/3x)  [-3£n  Z/3x  +  (3y/3x)/n]  ■  0. 


The  limiting  form  of  this  equation  for  large  n  becomes 
32y/3x2  -  (3y/3x)  (3£n  Z/3x)  *  0 


(Eq.  32) 


Again,  since  p(x)  is  significantly  greater  than  zero  over  only  a 
limited  range,  about  x  «  «,  we  may  insert  the  central  value  of  the 
slowly  varying  factor  (3£n  Z/3x)  in  Eq.  32,  i.e,  from  Eq.  31,  the 
constant  -1/8.  Equation  32  thus  becomes 

32y/3x2  +  1/8  3y/3x  =  0 
which  Integrates  immediately  to 
3y/3x  =  K  exp(-x/8)  . 

To  find  K  ,  put  x  =  «,  then,  using  Eq.  31 
1/8  =  Kexp(-«/8), 


yielding 


3y/3x  *  1/6  exp  -[(x-«)/8j  , 


(Eq.  33) 


Equation  33  also  integrates  immediately  to 
y  =  -  exp  -[(x-«)/6]  , 


(Eq.  34) 


there  being  no  constant  of  integration  since  y-*-0  as  x-h» 


Hence 


C  =  ey  =  exp  (-e_t) 


(Eq.  35) 


where  t  =  (x-«)/6 
But  from  Eq.  26 


p  =  C3y/3x  =  (1/6)  exp(-t  -e_t) 


(Eq.  36) 


t  /V  * 
h'  »  -. 


1. 


Equation  36  is  the  expression  given  by  Fisher  &  Tippett  quoted  in  [l] . 
From  Eqs.  26  and  31,  *  is  obtained  from  the  transcendental  equation 

In  Z(«)/3«  +  n  Z(«)/P(«)  *  0  (Eq.  37) 
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and  from  Eq.  31 


1/p  =  -(3£n  Z(«)/3«)  (Eq.  38) 

If  the  distribution  is  symmetrical  about  the  mean,  the  same  type  of 
procedure  will  yield  the  distribution  of  deviations  from  the  mean,  the 
results  being 

C  *  exp(-e_t)  (Eq.  39) 

p  =  (1/3)  exp(-t  -e’*)  ,  (Eq.  40) 

where 

t  =  (x-«)/e  (Eq.  41) 

(3£n  Z(3)/3«)  +  2n  Z/(2P-1)  =  0  (Eq.  42) 

(1/6)  «  -(3£n  Z(«)/3«)  (Eq.  43) 


With  one  exception,  these  are  the  same  equations  as  for  the  absolute 
maximum.  The  exception  is  the  value  of  (3£n  Z(«)/3«)  (Eqs.  37  and  42; 
viz  nZ/P  and  2nZ/(2P-l)  respectively.  Since  for  very  large  n,  «  becomes 
very  large,  so  that  P(«)  tends  to  unity,  the  first  expression  tends 
to  nZ  ,  and  the  second  to  2nZ. 

This  Implies  that  the  distribution  for  the  absolute  maximum  has  exactly 
the  same  form  as  for  the  maximum  deviation  from  the  mean,  provided  that 
for  the  former,  the  tables  and  curves  are  entered  with  (n/2)  Instead 
of  n  .  This  confirms  the  finding  in  a  previous  section. 

For  a  gausslan  distribution,  Z  =  (1//"2tt)  exp(-^x2).,..£0  that,  from 
Eq.  43,  1/6““. 

Unfortunately,  this  limiting  expression  is  not  very  accurate,  even  for 
n  =  10s.  It  could  hardly  be  expected  to  be  so,  since  it  is  independent 
of  the  parent  distribution  (within  limits). 

There  is,  however,  the  advantage  that  the  mean,  median,  and  variance  are 
readily  computable. 

From  Ref.  1 , 

Mean  x  =  «  +  yB  (where  y  -  Euler's  constant  =  0.577 _ ) 

variance  a2  =  (tt6)2/6 


In  addition,  by  putting  C  =  \  in  Eqs.  35  or  39, 

median  xm  *  «  -(£n  £n  2)6  =  «  +  0.3665  6  • 

From  this,  it  appears  that  the  mean  and  the  median  are  very  nearly  equal, 
and  both  are  greater  than  the  mode.  This  compares  well  with  the  curves 
of  Fig.  1. 
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Finally,  for  a  gausslan  distribution,  where  3  =  l/« 
o«  ■  tt/ /?  =1.28 

This  may  be  compared  with  the  approximation 

sxm  =  1.18,  shown  earlier, 
m 

SUMMARY  AND  DISCUSSION 

The  main  results  are  summarized  as  follows: 

(1)  For  samples  of  n  ,  where  n  ranges  from  ten  to  one  million,  the 
following  approximations  may  be  used  to  estimate  the  statistical  parameters 
of  the  maximum  deviation  (the  modulus) 

(a)  The  most  likely  value  (the  mode)  is  given  by 

X!  %  1.076(£n  n)0,58 

(b)  The  median  value  is 

x  %  1.15(£n  n)°“56 
m 

(c)  The  semi-spread  (half  the  range,  which  includes  68%  of 
values)  Is 

s  %  0.79(£n  n)‘°-47 

(d)  The  probability  distribution  function  p(x)  can  be 
described  by  a  universal  curve  relating 

sp(x)  to  (x-xi)/s  . 

(2)  For  the  absolute  maximum  the  formulae  and  curves  for  the  modulus 
may  be  used,  by  entering  with  n  equal  to  half  the  number  in  the  sample. 

(3)  A  limiting  form  as  rv+«>,  which  holds  for  a  large  class  of  parent 
distributions  Z(x)  ,  is  that 

C  =  e_t 

p  =  exp[-t  -e‘t] 

where 

t  *  (x-«)/8. 


«  ,  the  modal  value  of  x  ,  Is  given  by 

3£n  Z(«)/3«  +  nZ  =  0  (absolute  maximum) 
3£n  Z(«)/3*  +  2nZ  =  0  (maximum  deviate) 
1/0  ■  -3£n  Z(«)/3« 


For  this  distribution 


the  mean 


x  =  *  +  yB  (y  =  Euler's  constant  =  0.577.. 


the  median  xm  =  a  -Un  2)/B 
the  variance  o2  =  (tt&)  2/6 . 


/  k  .  * 

•  I 


h-  • 


This  expression  is  not  as  accurate,  up  to  n  *  106,  as  the  empirical 
expressions  in  this  study. 

In  discussing  these  results  it  must  be  remembered  that  the  aim  is 
pragmatic  —  to  find  easily  handleable  expressions  for  the  distributions, 
not  elegant  theoretical  results. 

The  most  striking  fact  is  the  accuracy  with  which  parameters  of  the 
distributions  may  be  estimated  to  a  respectable  accuracy,  over  a  very 
large  range  of  sample  size,  by  the  use  of  very  simple  expressions.  The 
derivation  of  the  limiting  form  sheds  some  light  on  why  this  may  be 
expected. 

The  second  feature  is  that,  perhaps  contrary  to  intuition,  the  distribution 
of  extreme  values  turns  out  to  be  predictable  within  quite  narrow  limits. 
The  implication  justifies  the  common  use  of  confidence  limits,  particularly 
for  large  samples. 

A  third  feature  is  perhaps  of  interest.  In  general,  statisticians 
search  for  'efficient'  statistics  (in  the  Fisher  sense).  That  is,  they 
prefer  to  deal  with  statistics  that  give  the  maximum  possible  Information 
about  the  parent  distribution.  This  is,  of  course,  especially  true  of 
investigations  In  which  the  aim  is,  by  experiment,  to  determine  what  the 
parent  distribution  is. 

However,  the  inverted  problem,  that  of  finding  a  predictable  statistic, 
does  not  necessarily  mean  that  the  best  choice  of  statistic  is  an  efficient 
one.  Since  an  inefficient  statistic  gives  little  Information  about  the 
parent  distribution  it  follows,  conversely,  that  in  many  Instances  lack 
of  knowledge  of  the  parent  distribution  is  of  relatively  minor  importance. 
This  can  be  of  value  in  system  design. 

The  l4mit  of  this  approach  Is,  naturally,  to  find  the  currently  fashionable 
'distribution-free'  parameters.  Here,  in  a  noise  background,  for  example, 
it  has  been  shown  that,  for  large  samples,  the  distribution  of  the 
maxima  is  sensibly  Independent  of  the  nature  of  the  parent  distribution, 
and  will  be  scaled  according  to  the  variance.  This  Implies  that,  In 
detecting  a  signal  against  noise,  the  effect  of  noise  spokes  hardly  depends 
on  whether  the  noise  is  gaussian  or  of  some  other  distribution:  It  can  be 
predicted  for  all  practical  purposes  merely  from  a  knowledge  of  the 
mean  intensity  —  a  somewhat  unexpected  result. 
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